AITopics | language model adaptation

Collaborating Authors

language model adaptation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Language Model Adaptation to Specialized Domains through Selective Masking based on Genre and Topical Characteristics

Belfathi, Anas, Gallina, Ygor, Hernandez, Nicolas, Dufour, Richard, Monceaux, Laura

arXiv.org Artificial IntelligenceFeb-26-2024

Recent advances in pre-trained language modeling have facilitated significant progress across various natural language processing (NLP) tasks. Word masking during model training constitutes a pivotal component of language modeling in architectures like BERT. However, the prevalent method of word masking relies on random selection, potentially disregarding domain-specific linguistic attributes. In this article, we introduce an innovative masking approach leveraging genre and topicality information to tailor language models to specialized domains. Our method incorporates a ranking process that prioritizes words based on their significance, subsequently guiding the masking procedure. Experiments conducted using continual pre-training within the legal domain have underscored the efficacy of our approach on the LegalGLUE benchmark in the English language. Pre-trained language models and code are freely available for use.

computational linguistic, ecthr, language model, (12 more...)

arXiv.org Artificial Intelligence

2402.12036

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > Ontario > Toronto (0.05)
Europe > France > Pays de la Loire > Loire-Atlantique > Nantes (0.04)
(8 more...)

Genre: Research Report (1.00)

Industry: Law (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Visual Comparison of Language Model Adaptation

#artificialintelligenceAug-19-2022, 03:56:41 GMT

Neural language models are widely used; however, their model parameters often need to be adapted to the specific domains and tasks of an application, which is time- and resource-consuming. Thus, adapters have recently been introduced as a lightweight alternative for model adaptation. They consist of a small set of task-specific parameters with a reduced training time and simple parameter composition. The simplicity of adapter training and composition comes along with new challenges, such as maintaining an overview of adapter properties and effectively comparing their produced embedding spaces. To help developers overcome these challenges, we provide a twofold contribution.

explanation method, language model adaptation, visual comparison, (2 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language (0.64)

Add feedback

Joint and Coupled Bilingual Topic Model Based Sentence Representations for Language Model Adaptation

Lu, Shixiang (Institute of Automation, Chinese Academy of Sciences) | Fu, Xiaoyin (Institute of Automation, Chinese Academy of Sciences) | Wei, Wei (Institute of Automation, Chinese Academy of Sciences) | Peng, Xingyuan (Institute of Automation, Chinese Academy of Sciences) | Xu, Bo (Institute of Automation, Chinese Academy of Sciences)

AAAI ConferencesAug-3-2013

This paper is concerned with data selection for adapting language model (LM) in statistical machine translation (SMT), and aims to find the LM training sentences that are topic similar to the translation task. Although the traditional approaches have gained significant performance, they ignore the topic information and the distribution information of words when selecting similar training sentences. In this paper, we present two bilingual topic model (BLTM) (joint and coupled BLTM) based sentence representations for cross-lingual data selection. We map the data selection task into cross-lingual semantic representations that are language independent, then rank and select sentences in the target language LM training corpus for a sentence in the translation task by the semantics-based likelihood. The semantic representations are learned from the parallel corpus, with the assumption that the bilingual pair shares the same or similar distribution over semantic topics. Large-scale experimental results demonstrate that our approaches significantly outperform the state-of-the-art approaches on both LM perplexity and translation performance, respectively.

bilingual topic model, language model adaptation, sentence representation

AAAI Conferences

Twenty-Third International Joint Conference on Artificial Intelligence

Genre: Research Report (0.53)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback